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The  structure  of  the  multiverse  is  determined  by  information  flow. 

1.  Introduction 

The  idea  that  quantum  theory  is  a  true  description  of  physical  reality  led  Everett 
(1957)  and  many  subsequent  investigators  (e.g.  DeWitt  and  Graham  1973,  Deutsch 
1985,  1997)  to  explain  quantum-mechanical  phenomena  in  terms  of  the  simultaneous 
existence  of  parallel  universes  or  histories.  Similarly  I  and  others  have  explained  the 
power  of  quantum  computation  in  terms  of  'quantum  parallelism'  (many  classical 
computations  occurring  in  parallel).  However,  if  reality  -  which  in  this  context  is 
called  the  multiverse  -  is  indeed  literally  quantum-mechanical,  then  it  must  have  a 
great  deal  more  structure  than  merely  a  collection  of  entities  each  resembling  the 
universe  of  classical  physics.  For  one  thing,  elements  of  such  a  collection  would 
indeed  be  'parallel':  they  would  have  no  effect  on  each  other,  and  would  therefore 
not  exhibit  quantum  interference.  For  another,  a  'universe'  is  a  global  construct  - 
say,  the  whole  of  space  and  its  contents  at  a  given  time  -  but  since  quantum 
interactions  are  local,  it  must  in  the  first  instance  be  local  physical  systems,  such  as 
qubits,  measuring  instruments  and  observers,  that  are  split  into  multiple  copies,  and 
this  multiplicity  must  propagate  across  the  multiverse  at  subluminal  speeds.  And  for 
another,  the  Hilbert  space  structure  of  quantum  states  provides  an  infinity  of  ways 
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of  slicing  up  the  multiverse  into  'universes',  each  way  corresponding  to  a  choice  of 
basis.  This  is  reminiscent  of  the  infinity  of  ways  in  which  one  can  slice  ('foliate')  a 
spacetime  into  spacelike  hypersurfaces  in  the  general  theory  of  relativity.  Given  such 
a  foliation,  the  theory  partitions  physical  quantities  into  those  'within'  each  of  the 
hypersurfaces  and  those  that  relate  hypersurfaces  to  each  other.  In  this  paper  I  shall 
sketch  a  somewhat  analogous  theory  for  a  model  of  the  multiverse. 

The  quantum  theory  of  computation  is  useful  in  this  investigation  because,  as  we 
shall  see,  the  structure  of  the  multiverse  is  determined  by  information  flow,  and  the 
universality  of  computation  ensures  that  by  studying  quantum  computational 
networks  it  is  possible  to  obtain  results  about  information  flow  that  must  also  hold 
for  quantum  systems  in  general.  This  approach  was  used  by  Deutsch  and  Hayden 
(2000)  to  analyse  information  flow  in  the  presence  of  entanglement.  In  that  analysis, 
as  in  this  one,  no  quantitative  definition  of  information  is  required;  the  following 
two  qualitative  properties  suffice: 

•  Property  1:  A  physical  system  S  contains  information  about  a  parameter  b  if 
(though  not  necessarily  only  if)  the  probability  of  some  outcome  of  some 
measurement  on  S  alone  depends  on  b. 

•  Property  2:  A  physical  system  S  contains  no  information  about  b  if  (and  for  present 
purposes  we  need  not  take  a  position  about  'only  if')  there  exists  a  complete 
description  of  S  that  is  independent  of  b. 

I  shall  assume  that  an  entity  S  qualifies  as  a  'physical  system'  if  (but  not  necessarily 
only  if)  it  is  possible  to  store  information  in  S  and  later  to  retrieve  it.  That  is  to  say,  it 
must  be  possible  to  cause  S  to  satisfy  the  condition  of  Property  1  for  containing 
information  about  some  parameter  b.  It  is  implicit  in  this,  and  in  Properties  1  and  2, 
that  b  must  be  capable  of  taking  more  than  one  possible  value,  so  there  must  exist 
some  suitable  sense  in  which  if  S  contained  different  information  it  would  still  be  the 
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same  physical  system.  This  condition  raises  interesting  questions  about  the  counter- 
factual  nature  of  information  which  it  will  not  be  necessary  to  address  here.  It  is  also 
necessary  that  S  be  identifiable  as  the  same  system  over  time.  This  is  particularly 
straightforward  if  S  is  causally  autonomous  -  that  is  to  say,  if  its  evolution  depends 
on  nothing  outside  itself. 

2.  Classical  computers 

Consider  a  classical  reversible  computational  network  containing  N  bits  B1...BN.  A 
specification  of  the  values  bx(t ),...,  bN(t)  of  the  bits  just  after  the  t' th  computational 
step  constitutes  a  complete  description  of  the  computational  state  of  the  network  at 
that  instant.  Given  the  structure  of  the  network  (its  gates,  and  how  the  carriers  of  the 
bits  move  between  them),  this  also  determines  the  computational  state  just  after 
every  other  computational  step.  We  are  not  interested  in  the  network's  state  during 
computational  steps,  nor  in  its  non-computational  degrees  of  freedom,  because  we 
know  that  the  computational  degrees  of  freedom  at  integer  values  of  t  form  a 
causally  autonomous  system,  and  it  is  that  system  which  we  shall  regard  as 
faithfully  modelling,  with  some  finite  but  arbitrarily  high  degree  of  accuracy,  the 
flow  of  information  in  a  classical  system  or  classical  universe. 

Information  flow  in  the  network  is  local  in  the  sense  that  if  some  information  is 
confined  to  a  set  of  bits  C  at  time  t,  then  at  time  t  +  1  that  information  is  confined  to 
bits  that  have  passed  through  the  same  gate  as  some  member  of  C  during  the 
(t  +  1)  'th  computational  step.  In  particular,  if  a  network  consists  of  two  or  more  sub¬ 
networks  that  are  disconnected  for  a  period,  then  information  cannot  flow  from  one 
of  those  sub-networks  to  another  during  that  period.  Where  a  system  S  has  local 
dynamics  -  for  instance,  if  it  is  a  field  governed  by  a  differential  equation  of  motion  - 
and  we  want  to  draw  conclusions  about  information  flow  in  S  by  studying  networks 
that  model  S  to  some  degree  of  approximation,  we  must  consider  only  models  with 
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the  property  that  local  regions  of  S  correspond  to  local  (in  the  above  sense)  regions 
of  the  network. 

If  we  were  to  construct  such  a  network  in  the  laboratory,  then  each  of  the  2N 
possible  bit-sequences  bu...,  bN  would  specify  a  physically  and  computationally 
different  state  of  the  network.  But  if  reality  consisted  of  such  a  network,  that  would 
not  necessarily  be  so,  because  there  would  then  be  no  external  labels,  such  as  spatial 
location,  to  distinguish  one  bit  from  another.  So,  for  instance,  if  the  network 
consisted  of  two  disjoint  sub-networks  with  identical  structures,  containing  bits 
Bv...,BnI 2  and  BN/2+l,...,BN  respectively,  then  any  two  bit-sequences  of  the  form 
P,,. . .,(3^  and  PA//2+1,...,PAr,P1,...,PA//2  would  refer  to  the  same  physical  state.  The  same 
applies  when  we  are  considering  a  hypothetical  network  that  models  information 
flow  in  reality  as  a  whole:  if  the  structure  of  such  network  is  invariant  under  some 
permutation  II  of  its  bits,  then  any  two  bit-sequences  that  are  related  by  n  refer  to 
the  same  state  of  reality. 

Let  us  refer  to  a  bit-sequence  bx(t ),...,  bN(t)  collectively  as  b(t)  (which  can  be  thought 
of  as  the  binary  number  2N ~1bN(t)+ . . .+  2b2(t)  +  b^t)  e  ).  During  each 
computational  step,  the  values  of  the  bits  in  the  network  change  according  to 

b(t+l)=f(b(t))r  (1) 

where  each  f  is  some  invertible  function  from  Z2„  to  itself,  which  characterises  the 
action  of  all  the  gates  through  which  the  bits  pass  during  the  (f+l)'th  computational 
step. 
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Fig.  1:  History  of  a  classical  computation 

The  course  of  such  a  computation  with  initial  state  b( 0)  =  (1  is  shown  schematically 
in  Fig.  1.  The  parts  of  the  graph  in  the  shaded  regions  (i.e.  during  computational 
steps),  and  the  non-integer  values  of  b,  have  no  significance  except  to  indicate  that 
the  motion  of  a  real  computer  would  interpolate  smoothly  between  computational 
states. 

3.  Ensembles  of  classical  computers 

Consider  a  collection  of  M  classical  networks  of  the  kind  described  in  Section  2,  all 
with  the  same  structure  in  terms  of  gates,  but  not  necessarily  all  starting  in  the  same 
initial  state.  One  way  of  describing  such  a  collection  is  as  a  single  network  consisting 
of  M  disconnected  sub-networks.  The  network  has  NM  bits  Bt . . ,BNM  ,  where  B1...BN 
belong  to  the  'first'  sub-network,  BN+1...B2N  to  the  'second',  and  so  on.  But  since  the 
structure  of  the  network  is  invariant  under  any  permutation  of  the  sub-networks,  we 
must  regard  any  pair  of  bit-sequences  of  length  NM  that  are  related  by  such  a 
permutation  as  referring  to  physically  identical  states. 

In  other  words,  when  such  sub-networks  are  in  identical  states,  they  are  fungible .  The 
term  is  borrowed  from  law,  where  it  refers  to  objects,  such  as  banknotes,  that  are 
deemed  identical  for  the  purpose  of  meeting  legal  obligations.  In  physics  we  may 
define  entities  as  fungible  if  they  are  not  merely  deemed  identical  but  are  identical,  in 
the  sense  that  although  they  can  be  present  in  a  physical  system  in  varying  numbers 
or  amounts,  permuting  them  does  not  change  the  physical  state  of  that  system. 
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Fungibility  is  not  new  to  physics.  Many  physical  entities,  such  as  amounts  of  energy, 
are  fungible  even  in  classical  physics:  one  can  add  a  Joule  of  energy  to  a  physical 
system,  but  one  cannot  later  extract  the  same  Joule.  In  quantum  physics  some 
material  objects  -  bosons  -  are  fungible  too:  it  makes  sense  to  ask  how  many 
identical  photons  there  are  in  a  cavity,  and  it  makes  sense  to  add  one  more  of  the 
same  kind  and  then  to  remove  one,  but  it  does  not  make  sense  to  ask  whether  the 
photon  that  has  been  removed  is  or  is  not  the  photon  that  was  previously  added 
(unless  there  was  exactly  one  photon  of  that  kind  present). 

Hence  an  alternative  way  of  describing  our  NM-bit  network  is  as  a  multiset  of  M 
networks,  each  with  N  bits.  A  multiset  is  like  a  set  except  that  some  of  its  elements 
are  fungible.  Each  element  is  associated  with  an  integer,  its  multiplicity ,  which 
specifies  how  many  instances  of  it  appear  in  the  multiset.  In  the  present  case,  if  \lb{t) 
is  the  number  of  sub-networks  that  are  in  the  state  beZlN  at  time  t ,  then  the  state  of 
the  network  at  time  t  is  completely  specified  by  the  2N  multiplicities  {(ib(f)}.  Their 
equation  of  motion  is 

pb(f  +  l)  =  |l/(_1(b)(f).  (2) 

An  ensemble  is  a  limiting  case  of  a  multiset  where  the  total  number  of  elements  M 
goes  to  infinity  but  all  the  proportions  ( lb(t)/M  tend  to  definite  limits.  Considered  as 
a  function  of  b,  lim(p(,  (f)/M)  is  the  distribution  function  for  the  ensemble  at  time  t. 
Henceforth  in  this  paper  I  shall  use  the  term  'ensemble'  for  both  ensembles  and 
multisets,  and  the  term  'multiplicity'  to  denote  both  discrete  multiplicities  and  real¬ 
valued  proportions. 
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Fig.  2:  History  of  an  ensemble  of  reversible  classical  computations 

Fig.  2  illustrates  schematically  a  computation  being  performed  by  an  ensemble  of 
twelve  computers.  Four  of  them  are  performing  the  same  computation  as  the 
computer  referred  to  by  Fig.  1,  with  input  (3.  Three  other  computations  are  being 
performed  in  parallel  with  that  one.  They  have  inputs  a,  y  and  8,  and  are  being 
performed  by  two,  one  and  five  of  the  computers  respectively. 

I  shall  refer  to  a  non-empty  sub-ensemble  in  which  all  the  computers  are  in  the  same 
state  as  a  branch  of  the  ensemble  -  so  for  instance,  the  ensemble  in  Fig.  2  has  four 
branches  throughout  the  computation.  Note  the  following  elementary  properties  of 
branches  under  reversible  classical  physics.  First,  the  total  number  of  branches  is 
conserved:  they  cannot  split,  join,  come  into  existence  or  be  destroyed.  Second,  the 
multiplicity  of  each  branch  is  conserved.  And  third,  none  of  the  branches  affect  each 
other;  that  is  to  say,  the  behaviour  of  each  branch  is  determined  by  its  own  initial 
state  and  f  and  is  therefore  independent  of  how  many  other  branches  are  present 
and  what  their  states  and  multiplicities  are.  These  properties  give  each  branch  a 
well-defined  identity  over  time,  even  though  the  values  of  its  bits  change. 

There  are  such  things  as  fungible  processes  as  well  as  fungible  objects.  Because  each 
of  the  computations  takes  place  within  a  particular  branch  over  time,  and  the  whole 
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ensemble  over  time  is  invariant  under  permutations  of  the  computations  within  a 
branch,  those  computations  are  themselves  fungible. 

In  this  representation  of  our  network  as  an  ensemble  E,  the  equation  of  motion  (2) 
specifies  how  the  multiplicity  of  a  fixed  bit-sequence  b  changes  with  time.  This  is  not 
well  suited  to  the  analysis  of  information  flow  because,  as  illustrated  in  Fig.  2,  the 
information  in  E  flows  entirely  in  branches  which  are  characterised  by  constant 
multiplicities  and  time-varying  bit-sequences.  It  is  possible  to  make  this  manifest  by 
using  an  alternative  representation  of  an  ensemble  of  classical  reversible  systems 
that  bears  the  same  relationship  to  the  standard  one  as  the  Heisenberg  picture  does 
to  the  Schrodinger  picture  in  quantum  theory.  To  motivate  the  representation, 
consider  first  a  list  bi 0)  of  the  computational  states  of  all  the  branches  in  the 
ensemble  at  time  zero,  in  any  order.  For  example,  bi 0)  for  the  ensemble  of  Fig.  2 
could  be  (a.fky.S).  It  will  turn  out  to  be  convenient  to  consider  such  lists  to  be 
vectors,  with  an  algebra  that  I  shall  define  below,  and  I  shall  call  such  vectors  e- 
numbers  by  analogy  with  the  terms  'q-number7  for  quantum  operators  and  'c- 
number7  for  scalars.  For  each  t>  0,  define  the  e-number  bit)  as  a  similar  list,  or 
vector,  of  the  states  of  all  the  branches  at  time  t,  with  the  branches  appearing  in  the 
same  order  as  they  do  in  bi 0).  So  in  the  Fig.  2  case,  bi l)=  (/0(a),/0(P),/0(y),/0(8)),  and 
so  on.  Thus  each  component  of  bit) ,  as  t  varies,  is  the  evolving  state  of  one  particular 
branch  of  the  ensemble.  Since  the  multiplicities  of  branches  are  constant,  we  can  list 
them  as  a  single,  constant  e-number  (1  =  (|ia  (0),(ip  (0),(iy  (0),|i5  (0)). 

The  quantities  (1  and  bit)  together  contain  the  same  information  as  the  {|ifc(/)},  and 
therefore  amount  to  a  complete  description  of  the  state  of  the  ensemble  at  time  t. 

In  general,  e-numbers  for  an  ensemble  of  N-bit  reversible  classical  networks  may  be 
defined  as  follows.  They  are  elements  of  a  2N -dimensional  vector  space  V.  (Lower- 
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dimensional  representations  are  sometimes  possible,  as  in  the  example  above.)  We 
can  express  bit) ,  or  any  other  e-number,  in  terms  of  an  orthonormal  basis  {p b  (/)} 

Ht)=£  bh(t).  (3) 

b 


A  straightforward  representation  for  Pb( 0)  would  be  a  list  of  length  2N,  whose 
( b  +l)'th  element  was  1,  with  the  rest  all  0.  The  equation  of  motion  for  the  ensemble 
is 

h(t+i)=P,em(t)-  (4) 


I  shall  refer  to  Pb(t )  as  the  'projector  for  the  bits  to  take  the  values  b  at  time  t' .  The 
reason  for  this  terminology  is,  as  will  become  apparent  below,  that  these  are  the  e- 
number  analogues  of  quantum  projection  operators  or  Boolean  observables.  Note 
that  they  are  not  projection  operators  on  V,  but  elements  of  V. 


If  M  are  any  c-number  coefficients  and  g  is  any  function  initially  defined  on  c- 
numbers,  we  can  define 


2 


=EsWk(t). 


V  b 


(5) 


From  (3),  (5)  and  (4)  it  follows  that 

kt+i) =/,(ko).  w 

Hence  the  equation  of  motion  (6)  of  the  e-bits  has  the  same  form  as  its  counterpart 
(1)  for  a  single  classical  computer,  with  e-numbers  replacing  integers. 

Apart  from  multiplication  by  a  c-number  in  the  usual  vector-space  manner,  the 
algebra  has  three  further  forms  of  multiplication,  which  may  all  be  defined  by  their 
actions  on  the  projectors:  The  scalar  product  x.y  is  defined  by 
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P.(t).Pt(t)=Sni.  (7) 

The  e-number  product  xy,  which  is  the  e-number  analogue  of  the  classical  or 
quantum  product  of  observables,  is  defined  by 

P,(t)Pt(t)=P,(t)S.t.  (8) 

We  can  also  define  a  unit  e-number  1  =  Hh  Pb(t)  for  any  t,  and  a  zero  e-number  6=0, 
which  have  the  usual  multiplicative  properties  with  respect  to  e-number 
multiplication.  The  tensor  product  x®  y,  which  is  used  for  combining  ensembles, 
could,  for  instance,  be  defined  by 

if(t)®pf(o=p“+i(o,  (9> 

where  the  two  projectors  on  the  left  of  (9)  refer  to  ensembles  E  and  E' ,  the  second 
having  N  bits  per  element,  and  the  projector  on  the  right  refers  to  the  combination 
ExE'  of  those  two  ensembles. 

The  connection  with  the  conventional  representation  is 

Vb(t)=\x.Pb(t).  (10) 

We  can  now  regard  Fig.  2  as  showing  the  history  of  b(t)  in  the  'state'  (1 ,  rather  than 
the  history  of  the  {|iA(t)}.  Note  that  from  a  complete  specification  of  the  algebra 
generated  by  the  e-numbers  (1,  bit)  and  1,  (i.e.  a  means  of  calculating  the  scalar 
products  of  all  the  expressions  that  can  constructed  from  those  e-numbers  by 
addition,  scalar  multiplication  and  e-number  multiplication),  one  can  obtain  the 
projectors  Pbit)  for  all  states  b  that  are  present  in  the  ensemble  at  any  time  t: 

Pb(t)=5(b(t)-b\),  (11) 

where  8  is  an  e-number  version  of  the  Kronecker  delta  function,  defined  by 
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It  then  follows  from  (10)  that  a  complete  description  of  the  ensemble  is  contained 
entirely  in  the  algebra  of  its  e-number  descriptors  (1,  bit)  and  1,  independently  of 
any  particular  representation  of  these  e-numbers  as  2N -tuples. 

4.  Quantum  computers  performing  classical  computations 

The  central  question  addressed  in  this  paper  can  now  be  stated  as  follows:  in  what 
sense,  and  in  what  approximation,  can  a  quantum  computation  be  said  to  contain  an 
ensemble  of  classical  computations? 

Consider  a  quantum  computational  network  containing  N  qubits  Q, , . . . ,  Qv . 
Following  Gottesman  (1999)  and  Deutsch  and  Hayden  (2000),  let  us  represent  each 
qubit  Qk  at  time  t  in  the  Heisenberg  picture  by  a  triple 

bi(t)=(y(f),si?(f),gt(f))  (i3) 

of  2n  x2n  Hermitian  matrices  representing  Boolean  observables  (projection 
operators)  of  Qk,  satisfying 

W(f),br(f)]=0 

f  _  2  (f  ))(l  -  2  (f ))  =  z(l  -  2  bkz  (t )) 

KM)2  =  \M) 

The  Heisenberg  state  |'F)  of  the  network  is  a  constant,  so  we  can  adopt  the 

/  A»  A  /V 

abbreviated  notation  {Xj  =  (VF  |X|  XF)  for  the  expectation  value  of  any  observable  X  of 
the  network. 
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The  effect  of  an  n-qubit  quantum  gate  during  one  computational  step  is  to  transform 
the  3n  matrices  representing  the  n  participating  qubits  into  functions  of  each  other  in 
such  a  way  that  the  relations  (14)  are  preserved. 

Rotations  of  any  descriptor  bkit),  considered  as  a  (matrix-valued)  3-vector  in  the 
Euclidean  x-y-z  space,  are  such  functions.  This  allows  a  large  class  of  possible 
alternative  representations  of  the  qubits,  corresponding  to  the  freedom  to  make  such 
rotations  for  each  qubit  independently  and  then  to  redefine  all  the  x-,  y-  and  z- 
directions.  By  convention  we  use  this  freedom  to  choose  a  representation  (if  one  is 
available)  in  which  the  z-components  bkSt )  are  stabilised  by  any  decoherence  or 
measurement  that  may  occur,  or  more  generally,  in  which  those  components  are 
performing  classical  computations  (see  below).  In  any  case,  we  can  define 

m  =  2N-'bNl(t)  +  ...+2kJt)+bJt),  (15) 

as  for  a  classical  computer,  though  note  that  bit)  is  not  a  complete  specification  of 
the  state  of  the  quantum  computer  at  time  t:  there  are  also  the  other  components  of 
the  descriptors  {bkit)} ,  and  the  Heisenberg  state  |VT).  In  principle,  one  could  change 
the  representation  at  every  computational  step,  but  that  adds  no  generality,  being 
the  same  as  studying  a  different  network  in  a  constant  representation.  It  would  also 
be  possible  to  construct  alternative  representations  that  were  related  to  this  one  by 
more  general  transformations  that  are  not  expressible  as  compositions  of  single¬ 
qubit  transformations.  However,  these  would  not  be  appropriate  in  the  present 
investigation  because  the  'qubits'  in  such  representations  would  not  be  local  in  the 
network,  and  in  order  to  model  information  flow  we  are  using  local  interactions 
(gates)  of  the  network  to  model  local  interactions  in  general  quantum  systems. 

A  quantum  network  (or  sub-network)  is  said  to  be  'performing  a  classical 
computation'  during  the  (f  +  l)'th  computational  step  if  its  bit  +  1)  =  fibit))  for  some 
function  /  (not  necessarily  invertible).  This  occurs  if  and  only  if  all  its  gates  that  act 
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on  qubits  during  that  step  have  classical  analogues  -  including  one-qubit  gates  with 
the  effect 


h(t+ 1)  =  (anything,  anything,  bjt)), 


(16) 


which  is  a  non-trivial  quantum  computation  but  corresponds  to  the  classical  gate 
whose  only  computational  effect  is  a  one-step  delay.  That  is  not  to  say  that  the 
quantum  network  is  a  classical  computer  during  such  a  period:  it  still  has  qubits 
rather  than  bits;  it  (or  at  least,  the  network  as  a  whole)  is  still  undergoing  coherent 
motion;  and  its  computational  state  is  not  specified  by  any  sequence  of  N  binary 
digits. 


The  Toffoli  gate,  which  is  universal  for  reversible  classical  computations,  is  defined 
as  having  the  following  effect  on  the  k' th,  Z'th  and  m'th  bits  of  a  classical  network: 


T(  f+T 

'  Kit) 

bt(t  +1) 

= 

m 

+  l)j 

(0  +  MO  w  -  2  w  wuoj 

It  follows  from  the  results  of  Section  3  that  in  an  ensemble  of  networks  containing  a 
Toffoli  gate,  its  effect  has  the  same  functional  form  as  (17),  with  e-numbers  replacing 
c-numbers: 


'bk(t  +  l) 

(  v  3 

hit) 

b,(t  + 1) 

= 

hit)  ^ 

b,„(t  + 1)^ 

v  bm  it)  +  bk  it)  b,  it)  -  2  bk  it)  hit)  bm  ( t)^ 

(18) 


Compare  this  with  the  effect  of  the  quantum  version  of  the  Toffoli  gate: 


bk(t+l) 

f( 

b,{t+ 1) 

-( 

h(t+ 1) 

'  he +  blzbmx  2 bkxbh  bmx , 


b 

umx  * 


k+Kk-2  k  ) 

k+LL-2bJ„L,  k  )• 

K + kk  -  A-kK,  k + kk  - 2  kkkj 

'  > 


(19) 
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For  the  sake  of  brevity,  the  parameter  t  has  been  suppressed  from  all  the  matrices  on 
the  right  of  (19).  It  is  easily  verified  that  the  conditions  (14)  are  preserved  by  this 
transformation.  Notice  that  the  z-components  bkz(t  +  l),  blz(t+l),  bmz(t  +  l)  of  the 
descriptors  of  the  qubits  emerging  from  the  gate  (third  column  on  the  right  of  (19)) 
depend  only  on  the  z-components  bkz(t),bIz(t),  bmz(t)  of  the  descriptors  of  the  qubits 
entering  the  gate.  Notice  also  that  these  z-components  commute  with  each  other  and 
that  their  equation  of  motion  has  the  same  functional  form  as  that  of  the 
corresponding  ensemble  of  classical  computers  (18).  Given  the  universality  of  the 
Toffoli  gate,  all  these  properties  must  hold  whenever  a  quantum  network,  or  any 
part  of  it,  performs  a  classical  computation.  In  other  words,  whenever  any  quantum 
network  (including  a  sub-network  of  another  network)  is  performing  a  classical 
computation/,  the  matrices  )}  for  that  network  evolve  independently  of  all  its 
other  descriptors.  Moreover,  under  the  following  correspondence 


Ensemble 

Quantum 

Kit) 

<-> 

kzit) 

bit ) 

<-> 

bit) 

V 

F 

<-> 

ITOCFI 

1 

<-> 

i 

hit) 

Phit)  =  \b-,t){b\t\ 

XY 

<-> 

TrXY 

XY 

<-> 

XY 

X®  Y 

<-> 

X  ®Y 

the  commuting  algebra  of  these  matrices  forms  a  faithful  representation  of  the 
algebra  of  e-numbers  describing  an  ensemble  of  classical  networks  performing  /  In 

A  /V  /V 

(20),  | b\t)  is  the  eigenvalue-^  eigenstate  of  bkz(t),  and  X  and  Y  are  the  same 
functions  of  the  {bkz(t)}  as  X  and  Y  respectively  are  of  the  {/y  (f )} . 

We  also  have  bit  +  1)  =  ft{bit)),  the  analogue  of  (6).  Thus  Fig.  2,  showing  the  course  of 
an  ensemble  of  classical  computations,  could  equally  well  be  a  graph  of  the 


14 


David  Deutsch 


The  Structure  of  the  Multiverse 


quantities  \Pb(ti)  in  a  quantum  computer  that  was  performing  the  same  classical 
computation  as  that  ensemble.  Note  also  that  while  the  quantities  (1  ,Pb(t)  form  a 
complete  description  of  the  ensemble  of  classical  computations,  the  \Ph (tfj  are  not  a 
complete  description  of  the  state  of  the  quantum  computation. 

Thus  in  any  sub-network  R  of  a  quantum  computational  network  where  a  reversible 
classical  computation  is  under  way,  half  the  parameters  describing  R  are  precisely 
the  descriptors  of  an  ensemble  of  classical  networks.  It  is  half  the  parameters 
because,  from  (14),  any  two  of  the  three  components  of  {bk(t)}  determine  the  third. 
This  does  not  imply  that  such  a  subsystem  constitutes  half  the  region  of  the 
multiverse  in  which  R  exists.  Proportions  in  the  latter  sense  -  which  formally  play 
the  role  of  probabilities  under  some  circumstances,  as  shown  in  Deutsch  (1999)  -  are 
determined  by  the  Heisenberg  state  as  well  as  the  observables,  and  do  not  concern 
us  here  because  the  present  discussion  is  not  quantitative. 

The  other  half  of  the  parameters,  say  the  {bkx(t)}),  contain  information  that  is 
physically  present  in  R  (it  can  affect  subsequent  measurements  performed  on  R 
alone)  but  cannot  reach  the  ensemble  (the  descriptors  of  the  ensemble  being 
independent  of  that  information).  But  the  reverse  is  not  true:  as  (19)  shows, 
information  can  reach  the  quantum  degrees  of  freedom  from  the  ensemble. 

The  proposition  that  parts  of  the  multiverse  have  the  same  description  as  an 
ensemble  with  given  properties  is  not  quite  the  same  as  the  proposition  that  such  an 
ensemble  is  actually  present  in  those  parts  of  the  multiverse,  for  the  description 
might  refer  to  entities  that  are  not  present  in  addition  to  those  that  are.  In  particular, 
an  ensemble  has  an  alternative  interpretation  as  a  notional  collection,  only  one 
member  of  which  is  physically  real,  with  the  multiplicity  of  a  given  branch 
representing  the  probability  that  the  properties  of  that  branch  were  the  ones 
prepared  in  the  real  system  at  the  outset,  by  some  stochastic  process.  However,  no 
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such  interpretation  is  possible  if  the  branches  affect  each  other,  as  they  do  in  general 
quantum  phenomena,  and  in  quantum  computations  in  particular  (see  Benjamin 
2001). 

5.  Quantum  computations 

When  a  quantum  computational  network  is  performing  a  general  computation,  it 
need  not  be  the  case  that  the  descriptors  of  any  part  of  the  network  over  two  or  more 
computational  steps  constitute  a  representation  of  an  evolving  e-algebra.  There  need 
exist  no  functions  ft  and  no  choice  of  the  'z-directions'  for  defining  the  bkzit)  and 
hence  bit) ,  such  that  bit  + 1)=  ft  {bit)),  so  the  conditions  discussed  in  Section  3  for 
branches  to  have  an  identity  over  time  need  not  hold.  At  each  instant  t,  it  is  still 
possible  to  extract  a  set  of  numbers  \Pbit ) )  from  the  description  of  the  network  at 
time  t,  and  these  still  constitute  a  partition  of  unity,  and  still  indicate  which  of  the 
eigenvalues  b  of  the  observable  bit)  are  present  in  the  multiverse  at  time  t  (in  the 
sense  that  if  b(t)  were  measured  immediately  after  time  t,  the  possible  outcomes 
would  be  precisely  the  values  for  which  \Pbit)  /  ^0).  But  although  the  physical 
evolution  is  of  course  always  continuous,  there  is  in  general  no  way  of  'connecting 
up  the  dots'  in  a  graph  of  the  quantities  \Pbit)  /  against  b  and  t  that  would  correctly 
represent  the  flow  of  information.  Hence  there  exists  no  entity  (such  as  a  'branch'  or 
'universe'),  associated  with  only  one  of  the  values  b,  that  can  be  identified  as  a 
physical  system  over  time. 
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Fig.  3:  History  of  a  quantum  computation 


In  a  typical  quantum  algorithm,  as  illustrated  schematically  in  Fig.  3,  the  qubits  first 
undergo  a  non-classical  unitary  transformation  U,  then  a  reversible  classical 
computation,  and  finally  another  unitary  transformation  which  is  often  the  inverse 
U"1  of  the  first  one.  Despite  the  fact  that  the  branches  lose  their  separate  identities 
during  the  periods  of  the  quantum  transformations  U  and  U'1,  we  can  still  track  the 
flow  of  information  reasonably  well  in  terms  of  ensembles:  For  f<— 1,  there  is  a 
homogeneous  ensemble,  in  all  elements  of  which  the  computer  is  prepared  with  the 
input  (3.  For  -1  <  t  <  0,  this  region  of  the  multiverse  does  not  resemble  an  ensemble:  it 
has  a  more  complicated  structure,  but  the  quantum  computer  as  a  whole  does  still 
contain  the  information  that  the  input  was  (I.  For  0  <t<  3  an  ensemble  is  present 
again,  this  time  with  four  branches.  The  information  about  b  may  no  longer  be 
wholly  present  in  that  ensemble;  some  or  all  of  it  may  be  in  the  other  half  of  the 
computer's  degrees  of  freedom.  For  t  >3  the  story  is  similar  to  that  for  t  <0,  but  in 
reverse  order,  so  that  finally  there  is  a  homogeneous  ensemble  with  all  elements 
holding  the  value  g(P ) . 

Consider  a  quantum  computation  whose  (t  +  1)  'th  step  has  the  effect:  'if  qubit  N  is  1, 
evaluate  the  invertible  function  f  on  qubits  1  to  N  —  1,  and  otherwise  perform  the 
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unitary  transformation  Uf  on  those  qubits'.  In  other  words,  during  the  (t  +  1)  'th  step 
the  computer  performs  the  transformation: 


2N_1-1 


v,=K  E  \mfb\+(i-Qvr 


(21) 


6=0 


on  all  N  qubits.  (Since  bNz  does  not  change  during  this  process  we  can  drop  its 
parameter  t.)  If  the  Uf  do  not  represent  classical  computations  then  clearly  the 
network  as  a  whole  is  not  performing  a  classical  computation  unless  \bN,/ =  1. 
Nevertheless,  it  is  still  the  case  that  some  of  the  descriptors  of  this  network  -  only 
about  a  quarter  of  them,  this  time,  namely  the  {bNzbkz(t)}  -  are  those  of  a  causally 
autonomous  ensemble  of  classical  computers,  which,  by  the  argument  above,  means 
that  such  an  ensemble  is  present.  Half  the  descriptors,  say  the 
{(i- bNz)bkz(t)]\j{{i  ~  bNz)b kx(t)] ,  do  form  a  causally  autonomous  system  but  do  not 
form  a  representation  of  an  e-algebra,  while  the  remaining  quarter,  say,  the 
{bNzbkx(t)} ,  are  neither  causally  autonomous  nor  (therefore)  form  a  representation  of 
an  e-algebra.  Thus  this  system  has  the  following  information-flow  structure:  it 
consists  of  two  subsystems  between  which  information  does  not  flow.  One  of  them 
is  performing  a  quantum  computation  and  cannot  be  further  analysed  into 
autonomous  subsystems;  the  other  contains  both  an  ensemble  of  2'  1  classical 
computations  and  a  further  system  that  can  not  be  analysed  into  autonomous 
subsystems;  moreover,  information  can  reach  it  from  the  ensemble  but  not  vice- 
versa. 


In  this  network,  the  individual  branches  of  the  ensemble  whose  e-number  algebra  is 
represented  by  the  matrices  {bNzbkz(t)} ,  qualify  as  physical  systems  according  to  the 
criteria  of  Section  1  because,  for  instance,  if  X(t )  is  any  observable  on  the  network  at 
time  t  and  0  <k<  2N_1,  a  measurement  of  the  observable  bNz(t)Pk(t)X(t)Pk(t)bNz(t)  is  a 
measurement  on  one  such  branch  alone  -  the  one  in  which  the  classical  computation 
is  taking  place  and  all  the  classical  computers  in  the  branch  are  in  state  k  at  time  t. 
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The  'controlled-not'  gate,  or  measurement  gate,  which  has  the  effect 


\b„M  +l)j 

/  A  A  A  A  /V  /V  A  A  A  \  ' 

(b  +b  -  2b  b  ,  b  +b  -2b  b  ,  b 

y  nx  mx  nx  mx/  nx  my  nx  my  /  mzj 

[Kit+D]' 

/  ■*•  /v  /v  yv/v  yv  yv  yvyv\ 

lb  ,  b  +b  -2b  b  ,  b  +b  -2b  b 

y  nx f  ny  mz  ny  mz r  nz  mz  nz  mz ) 

(22) 


where  again  the  parameter  t  has  been  suppressed  from  all  the  matrices  on  the  right 
of  the  equation,  can  be  used  to  model  the  effects  of  measurement  and  decoherence. 
Qm  is  known  as  the  'control'  qubit  and  Qn  the  'target'  qubit.  Because  this  is  a 
reversible  classical  computation  (or  rather,  the  quantum  analogue  of  one),  the  last 
(z-)  column  of  (22)  again  depends  only  on  the  z-components  bmz(t)  and  bnz(t)  of  the 
descriptors  of  the  qubits  entering  the  gate.  Furthermore,  the  z-component  of  the 
descriptor  of  the  control  qubit  is  unaffected  by  the  action  of  the  measurement  gate 
(i.e.  bmz(t  + 1)=  bmz(t)).  Therefore,  if  some  sub-network  of  a  quantum  network 
performs  a  classical  computation  for  a  period  if  the  network  is  isolated,  and  then  it  is 
run  with  some  or  all  of  the  observables  {bkz}  being  repeatedly  measured  between 
computational  steps,  it  will  still  perform  the  same  classical  computation  and  will 
contain  an  ensemble  identical  to  that  which  it  would  contain  if  it  were  isolated 
(though  its  other  descriptors  will  be  very  different).  Since  decoherence  can  be 
regarded  as  a  process  of  measurement  of  a  quantum  system  by  its  environment,  the 
same  conclusion  holds  in  the  presence  of  decoherence.  It  also  holds,  by  trivial 
extension,  if  the  classical  computation  is  irreversible,  since  an  irreversible  classical 
computation  is  simply  a  reversible  classical  computation  in  which  some  of  the 
information  leaves  (becomes  absent  from)  the  sub-network  under  consideration. 

Since  a  generic  quantum  computational  network  does  not  perform  anything  like  a 
classical  computation  on  a  substantial  proportion  of  its  qubits  for  many 
computational  steps,  it  may  seem  that  when  we  extend  the  above  conclusions  to  the 
multiverse  at  large,  we  should  expect  parallelism  (ensemble-like  systems)  to  be 
confined  to  spatially  and  temporally  small,  scattered  pockets.  The  reason  why  these 
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systems  in  fact  extend  over  the  whole  of  spacetime  with  the  exception  of  some  small 
regions  (such  as  the  interiors  of  atoms  and  quantum  computers),  and  why  they 
approximately  obey  classical  laws  of  physics,  is  studied  in  the  theory  of  decoherence 
(see  Zurek  1981,  Hartle  1991).  For  present  purposes,  note  only  that  although  most  of 
the  descriptors  of  physical  systems  throughout  spacetime  do  not  obey  anything  like 
classical  physics,  the  ones  that  do,  form  a  system  that,  to  a  good  approximation,  is 
not  only  causally  autonomous  but  can  store  information  for  extended  periods  and 
carry  it  over  great  distances.  It  is  therefore  that  system  which  is  most  easily 
accessible  to  our  senses  -  indeed,  it  includes  all  the  information  processing 
performed  by  our  sense  organs  and  brains.  It  has  the  approximate  structure  of  a 
classical  ensemble  comprising  'the  universe'  that  we  subjectively  perceive  and 
participate  in,  and  other  'parallel'  universes. 

In  Section  1  I  mentioned  that  the  theory  presented  here  does  roughly  the  same  job 
for  the  multiverse  as  the  theory  of  foliation  into  spacelike  hypersurfaces  does  for 
spacetime  in  general  relativity.  There  are  strong  reasons  to  believe  that  this  must  be 
more  than  an  analogy.  It  is  implausible  that  the  quantum  theory  of  gravity  will 
involve  observables  that  are  functions  of  a  c-number  time.  Instead,  time  must  be 
associated  with  entanglement  between  clock-like  systems  and  other  quantum 
systems,  as  in  the  model  constructed  by  Page  and  Wootters  (1983),  in  which  different 
times  are  seen  as  special  cases  of  different  universes.  Hence  the  theory  presented 
here  and  the  classical  theory  of  foliation  must  in  reality  be  two  limiting  cases  of  a 
single,  yet-to-be-discovered  theory  -  the  theory  of  the  structure  of  the  multiverse 
under  quantum  gravity. 
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